Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/22481/17

Structure of Pauses in Speech in the Context of Speaker Verification and Classification of Speech Type

01-Jan-1970 Research 2017 : April - June

Magdalena Igras-Cybulska, Bartosz Ziolko, Piotr Zelasko, Marcin Witkowski

Statistics of pauses appearing in Polish as a potential source of biometry information for automatic speaker recognition\nwere described. The usage of three main types of acoustic pauses (silent, filled and breath pauses) and syntactic pauses\n(punctuation marks in speech transcripts) was investigated quantitatively in three types of spontaneous speech\n(presentations, simultaneous interpretation and radio interviews) and read speech (audio books). Selected parameters of\npauses extracted for each speaker separately or for speaker groups were examined statistically to verify usefulness of\ninformation on pauses for speaker recognition and speaker profile estimation. Quantity and duration of filled pauses,\naudible breaths, and correlation between the temporal structure of speech and the syntax structure of the spoken\nlanguage were the features which characterize speakers most. The experiment of using pauses in speaker biometry\nsystem (using Universal Background Model and i-vectors) resulted in 30 % equal error rate. Including pause-related\nfeatures to the baseline Mel-frequency cepstral coefficient system has not significantly improved its performance. In the\nexperiment with automatic recognition of three types of spontaneous speech, we achieved 78 % accuracy, using GMM\nclassifier. Silent pause-related features allowed distinguishing between read and spontaneous speech by extreme\ngradient boosting with 75 % accuracy.

How to Cite this Article
CC Compliant Citation: Igras-Cybulska, Magdalena, et al. \"Structure of pauses in speech in the context of speaker verification and\nclassification of speech type.\" EURASIP Journal on Audio, Speech, and Music Processing 2016.1 (2016): 18, DOI 10.1186/\ns13636-016-0096-7, http://creativecommons.org/licenses/by/4.0/.
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/22481/17

Structure of Pauses in Speech in the Context of Speaker Verification and Classification of Speech Type

How to Cite this Article

Links

Contact Us